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Abstract 

This is an expository account of the proof of Kontsevich's combi- 
natorial formula for intersections on moduli spaces of curves following 
the paper ^Ij. It is based on the lectures I gave on the subject in St. 
Petersburg in July of 2001. 

These are notes from the lectures I gave in St. Petersburg in July of 
2001. Our goal here is to give an informal introduction to intersection theory 
on the moduli spaces of curves and its relation to random matrices and 
combinatorics. More specifically, we want to explain the proof of Kontsevich's 
formula given in jHj and how it is connected to other topics discussed at 
this summer school such as, for example, the combinatorics of increasing 
subsequences in a random permutations. 

These lectures were intended for an audience of mostly analysts and com- 
binatorialists interested in asymptotic representation theory and random ma- 
trices. This is very much reflected in both the selection of the material and 
its presentation. Since absolutely no background in geometry was assumed, 
there is a long and very basic discussion of what moduli spaces of curves and 
intersection theory on them are about. We hope that a reader trained in 
analysis or combinatorics will get some feeling for moduli of curves (without 
worrying too much about the finer points of the theory, all but a few of which 
were swept under the rug). 

Conversely, in the second, asymptotic, part of the text, I allowed myself 
to operate more freely because the majority of the audience was experienced 
in asymptotic analysis. Also, since many fundamental ideas such as e.g. the 
KdV equations for the double scaling limit of the Hermitian 1-matrix model 
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were at length discussed in other lectures of the school, their discussion here 
is much more brief that it would have been in a completely self-contained 
course. A much more detailed treatment of both geometry and asymptotics 
can be found in the paper ITJ], on which my lectures were based. 

It is needless to say that, since this is an expository text based on my 
joint work with Rahul Pandharipande, all the credit should be divided while 
any blame is solely my responsibility. Many people contributed to the success 
of the St. Petersburg summer school, but I want to especially thank A. Ver- 
shik for organizing the school and for the invitation to participate in it. I 
am grateful to NSF (grant DMS-0096246), Sloan foundation, and Packard 
foundation for partial financial support. 

1 Introduction to moduli of curves 
1.1 

Let me begin with an analogy. In the ideal world, the moduli spaces of curves 
would be quite similar to the Grassmann varieties 

Grk^n = {L(1 C",dimL = k} 

of fc-dimensional linear subspaces L of an n-dimensional space. While any 
such subspace L is geometrically just a fc-dimensional vector space C'^, non- 
trivial things can happen if L is allowed to vary in families, and this nontriv- 
iality is captured by the geometry of Grk,n- 

A convenient formalization of the notion of a family of linear spaces pa- 
rameterized by points of some base space S is a (locally trivial) vector bundle 
over B. There is a natural tautological vector bundle over the Grassmannian 
Grfc „ itself, namely the space 

L = {{L,v),ve L C C"} 

formed by pairs (L, v), where L is a fc-dimensional subspace of C" and f is a 
vector in L. Forgetting the vector v gives a map L —>■ Grk,n whose fiber over 
L G Grk^n is naturally identified with L itself. 
Given any space B and a map 

(j): B ^ Grk,n 
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we can form the pull-back of L 

(fL = {{b,v),be B,v eC'\v e 0(6)} 

which is a rank k vector bundle over B. For a compact base B, in the 
n — > oo limit this becomes a bijection between (homotopy classes) of maps 
B — s> Gvk^oD and (isomorphism classes) of rank k vector bundles on B. 

In particular, one can associate to a vector bundle 0*£ its characteristic 
cohomology classes obtained by pulling back the elements of H*{Grk,n) via 
the map 0. Intersections of these classes describe the enumerative geometry 
of the bundle It is thus especially important to understand the inter- 

section theory on the space Gr^ „ itself — and this leads to a very beautiful 
classical combinatorics, in particular, Schur functions play a central role (see 
for example Chapter 14). 

1.2 

One would like to have a similar theory with families of linear spaces replaced 
by families of curves of some genus g. That is, given a family F of, say, smooth 
genus g algebraic curves parameterized by some base B we want to have a 
natural map (j) : B ^ Mg that captures the essential information about the 
family F. Here JAg is the moduli space of smooth curves of genus g, that is, 
the space of isomorphism classes of smooth genus g curves. At this point it 
may be useful to be a little naive about what we mean by a family of curves 
etc., in this way we should be able to understand the basic issues without 
too many technicalities getting in our way. Basically, a family F of curves 
with base i? is a "nice" morphism 

Ti:F^B 

of algebraic varieties whose fibers 7r^^(6), b ^ B, are smooth complete genus 
g curves. We want the moduli space and the induced map (p : B ^ Mg 
to be also algebraic. 

We will see that the first difficulty with the above program is that, in 
general, the family F will not be a pull-back of any universal family over M^. 
To get a sense of why this is the case we can cheat a little and consider the 
(normally forbidden) case g = 0. Up to isomorphism, there is only one curve 
of genus 0, namely the projective line P^. Hence the map in this case can 
only be the trivial map to a point. There exist, however, highly nontrivial 
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families with fibers isomorphic to or even as we, in fact, already saw 
above in the example of the tautological rank 1 bundle over Gri n — P""^. 

The reason why there exist locally trivial yet globally nontrivial families 
with fiber is that P^ has a large automorphism group which one can use 
to glue trivial pieces in a nontrivial way. Basically, the automorphisms are 
the principal issue behind the nonexistence of a universal family of curves 
over Mg. The situation becomes manageable, if not entirely perfect, once one 
can get the automorphism group to be finite (which is automatic for smooth 
curves of genus g > 1). A standard way to achieve this is to consider curves 
with sufficiently many marked points on them (> 3 marked points for = 
and > 1 ioT g — 1). Since curves with marked points arise very naturally in 
many other geometric situations, the moduli spaces M^, „ of smooth genus g 
curves with n distinct marked points should be considered on equal footing 
with the moduli spaces Mg of plain curves. 

1.3 

As the first example, let us consider the space Mi^i of genus g = 1 curves C 
with one parked point p E C. By Riemann-Roch the space of H^{C, 0{2p)) 
of meromorphic functions on C with at most double pole at p has dimen- 
sion 2. Hence, in addition to constants, there exists a (unique up to linear 
combinations with constants) nonconstant meromorphic function 

with a double pole at p and no other poles (this is essentially the Weier- 
strafi function p). Thus, / defines at a 2-fold branched covering of P^ doubly 
ramified over oo e P^. For topological reasons, it has three additional ram- 
ification points which, after normalization, we can take to be 0, 1 and some 
A e pi \ {0,l,oo}. 

Now it is easy to show that such a curve C must be isomorphic to the 
curve 

C ^ {y^ = x{x- l)(x- X)} (1) 

in such a way that the point p becomes the unique point at infinity and the 
function / becomes the coordinate function x. It follows that every smooth 
pointed g — 1 curves occurs in the following family of curves 

F = {{x,y,X),y^ = x{x - l){x - X)} (2) 
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with the base 

B = {A} = P^\{0,l,oo}, 

where the marked point is the point p at infinity. For example, the curve ((H) 
corresponding to A = | is plotted in Figure [TJ 



2-1 




-2-1 



Figure 1: The A = | member of the family (j21). 

However, a given curve C occurs in the family (jSj) more than once. Indeed, 
we made arbitrary choices when we normalized the 3 critical values of / to 
be 0, 1, and A, respectively. At this stage, we can choose any of 6 = 3! 
possible assignments which makes the symmetric group 5'(3) act on the base 
B preserving the isomorphism class of the fiber. Concretely, this group is 
generated by involutions 

A ^ 1 - A, 

which interchanges the roles of and 1, and by 

A ^ 1/A, 

exchanging the roles of 1 and A. It can be shown that two members of the 
family F are isomorphic if and only if they belong to the same 5'(3) orbit. 
Thus, the structure map : 5 — > Mi^i should be just the quotient map 

(f): B/S{3) = SpecC[fi]^(=^) . 
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Here C[S]'^^^) is the algebra of S'(3)-invariaiit regular functions on B. This 
algebra is a polynomial algebra with one generator, the traditional choice for 
which is the following 



A2(A-1)2 ■ 
Thus, Mi,i is simply a hne 

Mi,i = SpecC[7-] ^C. 

It is now time to point out that the family F is not a pull-back 0* of 
some universal family over Mi^i. The simplest way to see this is to observe 
that the group S{3) fails to act on F. Indeed, let us try to lift the involution 
A ^ 1 — A from B to F. There are two ways to do this, namely 

(x, y. A) 1-^ (1 - X, ±iy, 1 - A) , 

neither of which is satisfactory because the square of either map 

{x,y,X) ^ {x,-y,X) 

yields, instead of identity, a nontrivial automorphism of every curve in the 
family F. One should also observe that both choices act by a nontrivial 
automorphism on the fiber over the fixed point A = | G -B. In fact, the fibers 
of F over fixed points of a transposition and a 3-cycle in S{3), respectively, 
(with j(A) = 1728 and j(A) = 0, resp.) are precisely the curves with extra 
large automorphism groups (of order 4 and 6, resp.). 

The existence of an nontrivial automorphism of every pointed genus 1 
curve leads to the somewhat unpleasant necessity to consider every point of 
Mi,i as a "half-point" in some suitable sense in order to get correct enu- 
merative predictions. Again, automorphisms make the real world not quite 
ideal. 

While it is important to be aware of these automorphism issues (for ex- 
ample, to understand how intersection numbers on moduli spaces can be 
rational numbers), there is no need to be pessimistic about them. In fact, by 
allowing spaces more general than algebraic varieties (called stacks) one can 
live a life in which Mp,„ is smooth and with a universal family over it. This 
is, however, quite technical and will remain completely outside the scope of 
these lectures. 
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1.4 



Clearly, the space Mi^i = C is not compact. The j-invariant of the curve 
goes to oo as the parameter A approaches the three excluded points {0, 1, oo}. 
As A approaches or 1, the curve C acquires a nodal singularity; for example, 
for A = 1 we get the curve 

y'^ = x{x — 1)^ 

plotted in Figure El It is natural to complete the family (0) by adding the 




Figure 2: The nodal A = 1 curve in the family (j^J. 

corresponding nodal cubics for A G {0, 1}. All plane cubic with a node being 
isomorphic, the function j extends to a map 

J : C ^ pi/5(3) = ^ 

to the moduli space of Mi^i of curves of arithmetic genus 1 with at most one 
node and a smooth marked point. ^ 

In general, it is very desirable to have a nice compactification for the 
noncompact spaces Mg^„. First of all, interesting families of curves over a 

^For future reference we point out that something rather different happens in the family 
(0) as A — > oo. Indeed, the equation 
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complete base B are typically forced to have singular fibers over some points 
in the base (as in the example above). Fortunately, as we will see below, 
it often happens that precisely these special fibers contain key information 
about the geometry of the family. Also, since eventually we will be interested 
in intersection theory on the moduli spaces of curves, having a complete space 
can be a significant advantage. 

A particularly remarkable compactification Mg^„ of Mg^„ was constructed 
by Deligne and Mumford. The space M-g^n is the moduli space of stable 
curves C of arithmetic genus g with n distinct marked points. Stability, by 
definition, means that the curve C is complete and connected with at worst 
nodal singularities, all marked points are smooth, and that C, together with 
marked points, admits only finitely many automorphisms. In practice, the 
last condition means that every rational component of the normalization of C 
should have at least 3 special (that is, lying over marked or singular points of 
C) points. Observe that, in particular, the curve C is allowed to be reducible. 
A typical stable curve can be seen in Figure El 




Figure 3: A boundary element of Mi, 5 



becomes x(x — 1) = in the X 00 limit, which means that we get three Hncs (one of 
which is the hne at infinity), all three of them intersecting in the marked point at infinity. 
In other words, the fiber of the family (O at A = 00 is very much not the kind of curve by 
which we want to compactify Mi,i. This problem can be cured, but in a not completely 
trivial way, see below. 
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1.5 



Those who have not seen this before are probably left wondering how it is 
possible for Mg,„ to be compact. What if, for example, one marked point pi 
on some fixed curve C approaches another marked point p2 & C 7 We should 
be able to assign some meaningful stable limit to such a 1-parametric family 
of curves, but it is somewhat nontrivial to guess what it should be. 

A family of curves with a 1-dimensional base S is a surface F together 
with a map n : F ^ B whose fibers are the curves of the family. Marking n 
points on the curves means giving n sections of the map vr, that is, n maps 

Pu...,Pn: B ^ F , 

such that 

'^{Pkib))=b, beB, k=l,...,n. 

We will denote by 

S^=P^iB) 

the trajectories of the marked points on F; they are curves on the surface F. 

Now suppose we have a 1-dimensional family of 2-pointed curves such 
that at some bad point bo of the base B we have Pi(&o) = ^2(^0); that is, over 
this point two marked points hit each other, see Figured and therefore the 
fiber 7r~^(6o) is not a stable 2-pointed curve. It is quite easy to repair this 




B b, 



Figure 4: A family with colliding marked points 

family: just blow up the offending (but smooth) point P = Pi(6o) = ^2(^0) 
on the surface F. Let 

a: F ^ F 
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be the blow-up at P. Then 



-K = -Koa: F^B 

is new family of curves with base B. Outside 60 this the same family as before, 
whereas the fiber 7r~^(6o) is the old fiber 7r~^(6o) plus the exceptional divisor 
E = a~^{P) = of the blow-up, see Figure El Assuming the sections 5*1 




B 



Figure 5: Same family, except with collision point blown up 

and ^2 met each other and the fiber 7r~^(6o) transversally at P, as in Figure 
m the marked points on 7r~^(6o) are two distinct point on the exceptional 
divisor E. Therefore, 7f~^(6o) is a stable 2-pointed curve which is the stable 
limit of the curves vr^^(6) as 6 — 6o- 

To summarize, if one marked point on a curve C approaches another then 
C bubbles off a projective line with these two points on it as in Figure IHl 




Figure 6: Bubbling off a projective line 

More generally, if F is any family of curves with a smooth 1-dimensional 
base B that are stable except over one offending point Bq & B then after 
a sequence of blow-ups and blow-downs and, possibly, after passing to a 
branched covering of B, one can always arrive at a new family with all fibers 
stable. Moreover, the fiber over 60 in this family is determined uniquely. 
This process is called the stable reduction and how it works is explained, for 
example, in Chapter 3C. 
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In particular, there exists a stable reduction of the family (j2)) which, as 
we saw in Section 11.41 fails to have a stable fiber over the point A = oo in 
the base. This is an example where only blow-ups and blow-downs will not 
suffice, that is, a base change is necessary. 

1.6 

The topic of this lectures is intersection theory on the Deligne-Mumford 
spaces Mg^n and, specifically, intersections of certain divisors ipi which will 
be defined presently. It was conjectured by Witten jTHI that a suitable gen- 
erating function for these intersections is a r-function for the Korteweg-de 
Vries hierarchy of differential equations. This conjecture was motivated by 
an analogy with matrix models of quantum gravity, where the same KdV hi- 
erarchy appears (this was already discussed in other lectures at this school). 
The KdV equations were deduced by Kontsevich in [H] from an explicit com- 
binatorial formula for the intersections of the ^/'-classes (see also, for example, 
[2] for more information about the connection to the KdV equations). The 
main goal of these lectures is to explain a proof of this combinatorial formula 
of Kontsevich following the paper jT^ . 

The definition of the divisors ipi is the following. A point in Mg_„ is 
a stable curve C with marked points By definition, all points 

Pi are smooth points of C, hence the tangent space Tp-C to C at pi is a 
line. Similarly, we have the cotangent lines T*^C, i = 1, . . . ,n. As the point 
(C,pi, . . . ,pn) € ^g,n varics, these cotangent lines T*.C form n line bundles 
over Mg^„. By definition, ipi is the first Chern class of the line bundle T*^C. 
In other words, it is the divisor of any nonzero section of the line bundle 
T*C. 

1.7 

To get a better feeling for these classes let us intersect ipi with a curve in 
JAg^n- The answer to this question should be a number. Let _B be a curve. 
A map B — > Mg^„ is morally equivalent to a 1-dimensional family of curves 
with base B (in reality we may have to pass to a suitable branched covering 
of B to get an honest family. ^) So, let us consider a family vr : F — i? of 

^We already saw an example of this in Section Indeed, Mi.i is itself a line P^. 
However, in order to get an actual family over it we have to go to a branched covering. 
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stable pointed curves with base B and the induced map (p : B ^ As 
usual, the marked points pi, . . . ,p„ are sections of vr and 



l,...,n, 



Si=pi{B), i 

are disjoint curves on the surface F. A section s of (j)*{Tp-) is a vector field 




B 



Figure 7: The self-intersection of Si = Pi{B) on F. 

on the curve Si which is tangent to fibers of vr and, hence, s is a section of 
the normal bundle to Si C F, see Figure [3 The degree of this normal bundle 
is the self-intersection of the curve 5*^ on the surface F, that is, 

deg (s) = {Si, Si)F , 

where (s) is the divisor of s. In other words, 



where Ci denotes the 1st Chern class. Dually, we have 



= -{Si, Si)F ■ 

We will now use this formula to compute the intersections of ipi with Mp^„ 
in the cases when the space Mg^„ is itself 1-dimensional. Since 



dim M.g^n = 3g — 3 + n , 

this happens for Mo,4 and Mi,i. 

As always, the automorphisms are to blame. 



(3) 
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1.8 



The space Mo,4 is easy to understand. After all, there is only one smooth 
curve of genus 0, namely P^. Moreover, any 3 distinct points of can 
be taken to the points {0, 1, oo} by an automorphism of P^ (in particular, 
this means that Mo,3 is a point). After we identified the first three marked 
points with {0, 1, oo}, we can take any point x G P^ \ {0, 1, oo} as the fourth 
marked point. Thus the locus of smooth curves in Mo,4 is isomorphic to 
P^ \ {0,1, oo}. Singular curves are obtained as we let x approach the 3 
excluded points {0,1, oo}, which, by the process described in Section [T31 
bubbles off a new P^ with two marked points on it. This completes the 
description of Mo,4 = P^ 

In addition, this gives a description of the universal family over Mo,4 (oh 
yes, in genus it does exist !). Take P"*^ x P^ with coordinates {x,y). The 
map {x, y) ^ X with 4 sections 

Pi(x) = (x, 0) , p2(a;) = (x, 1) , p3(x) = (x, oo) , ^4(3;) = (x, x) , 

defines a family of 4-pointed smooth genus curves for x G P^ \ {0, 1, 00}. 
The section collides with the other three at the points (0,0), (1, 1), and 
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Figure 8: The trivial family P x P with sections pi, . . . ,P4. 



(00, 00), see Figure IHl To extend this family over all of P^, we blow up these 
collision points as in Section lT31 and get the surface F shown in FigurelHl The 
closures of the curves pi(x), . . . ,Pi{x) give 4 sections which are now disjoint 
everywhere. 

Incidentally, this surface F can be naturally identified with Mo,5 and, 
more generally, for any n there exists a natural map Mo,„+i Mo,n giving the 
universal family over Mo,n- This map forgets the (n + l)st marked point and, 
if the curve becomes unstable after that, blows down all unstable components. 
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7 oo 

Figure 9: The universal family over Mo,4. 

Now let us compute ^ ipi using the recipe given in Section 11.71 Recall 
that Si C F denotes the closure of the curve {(x,0)}, x ^ in F (a.k.a. 
the proper transform of the corresponding curve in x P^). Let E denote 
the preimage of (0, 0) under the blow-up, that is, let E be the exceptional 
divisor. The self-intersection of 5*1 with any curve {y = c},c^ 0, on X pi 
is clearly zero. Letting c ^ we get 

{Si,E + Si) = 0. 

Since, obviously, {Si,E) = 1 we conclude that 

/ iji = -{SuSi) = -{-l) = l. 

J]V[o,4 

1.9 

Now let us analyze the integral ^tpi. In the absence of a universal family, 
we have to look for another suitable family to compute this integral. A 
particularly convenient family can be obtained in the following way. Consider 
the projective plane P^ with affine coordinates {x,y). Pick two generic cubic 
polynomials f{x, y) and g{x, y) and consider the family of cubic curves 

F = {(x, y, t), fix, y) - tg{x, = 0} C P' x P\ (4) 

with base S = P^ parameterized by t. The cubic curves f{x,y) = and 
g{x,y) = intersect in 9 points pi, . . . ,Pq and we can choose any of those 
points as the marked point in our family. An example of such family of plane 
cubics is plotted in Figure UHl 
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Figure 10: A pencil of plane cubics. 



Our first observation is that the surface F is the blow-up of at the 
points pi, . . . ,p9 (which are distinct for generic / and g). Indeed, we have a 
rational map 



which is regular away from pi, . . . ,^9. Each of the points pi is a transverse 
intersection of / = and g = which is another way of saying that at 
those points the differentials df and dg are linearly independent. Thus, this 
map identifies F with the blow-up of at pi, . . . ,^9. The graph of the 
function is shown in Figure ^2 This graph goes vertically over the 

points pi, . . . ,p9 that are blown up. 

Since the section 5*1 is exactly one of the exceptional divisors of this 
blow-up, arguing as in Section ITTHl above we find that 



It does not mean, however, that we are done with the computation of the 
integral, because the induced map : i? — >■ Mi^i is very far from being one- 
to-one. In fact, set-theoretically, the degree of the map is 12 as we shall 
now see. 

To compute the degree of we need to know how many times a fixed 
generic elliptic curve appears in the family F . This is a classical computation. 
First, one can show that the singular cubic is generic enough. Then we claim 
that, as t varies, there will be precisely 12 values of t that produce a singular 




(5*1, ^i) 



-1. 
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Figure 11: A fragment of the surface (@)) 



curve. There are various ways to see this. For example, the singularity of 
the curve is detected by vanishing of the discriminant. The discriminant of 
a cubic polynomial is a polynomial of degree 12 in its coefficients, hence a 
polynomial of degree 12 in t. 

A alternative way to obtain this number 12 is to compute the Euler 
characteristic of the surface F in two different ways. On the one hand, 
viewing F as a blow-up, we get 

x(F) = x(P') + 9(x(P')-l) = 12. 

On the other hand, F is fibered over B and the generic fiber is a smooth 
elliptic curve whose Euler characteristic is 0. The special fibers are the nodal 
elliptic curves with Euler characteristic equal to 1. Hence, there are 12 special 
fibers. 

However, as remarked in Section II. 3( each point of Mi,i is really a half- 
point because of automorphism of order 2 of any pointed genus 1 curve. 
Therefore, the 24 = 2 ■ 12 is the true degree of the map 0. By the push-pull 
formula we thus obtain 

deg^Js deg0 24 
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An interesting corollary of this computation is that if F ^ 5 is a smooth 
family of 1-pointed genus 1 stable curves over a smooth complete curve B 
then the set-theoretic degree of the induced map B Mi i has to be divisible 
by 12. 

1.10 

It is difficult to imagine being able to compute many intersections of the 
■j/'-classes in the above manner. To begin with, it is essentially impossible 
to write down a sufficiently explicit family of general high genus curves, see 
the discussion in Chapter 6F of |H]. It is therefore amazing that there exist 
several complete and beautiful descriptions of the all possible intersection 
numbers of the form 

{Tk,...nJ= [_ h + --- + K = 3g-3 + n. (5) 

The most striking description was conjectured by Witten ^\ and says the 
exponential of the following generating function for the numbers (0) 

k \ ^ . . . ^kfi 

is a r-function for the KdV hierarchy of differential equations. This conjec- 
ture was motivated by the (physical) analogy with the random matrix models 
of quantum gravity and, in fact, the r-function thus obtained is the same as 
the one that arises in the double scaling of the 1-matrix model (and discussed 
in other lectures of this school) . The KdV equation and the string equation 
satisfied by the r-function uniquely determine all numbers Alterna- 
tively, the numbers Q are uniquely determined by the associated Virasoro 
constraints. Further discussion can be found, for example, in 0. 

1.11 

Kontsevich in obtained the KdV equations for from a combinatorial 
formula for the following (somewhat nonstandard) generating function 

r. , ^ V- / V TT (2A;i - 1)!! 

ki + ---+kn=3g-3+n 
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for the numbers (0) with fixed g and n. 

The main ingredient in Kontsevich's combinatorial formula is a 3-valent 
graph G embedded in the a topological surface S^. A further condition on 
this graph G is that the complement '^g\G is a union of n topological disks (in 
particular, this forces G to be connected). These disks, called cells, have to 
(bijectively) numbered by 1, . . . , n. Two such graphs G and G' are identified 
if there exist an orientation preserving homeomorphism of that takes G 
to G' and preserves the labels of the cells. In particular, every graph G has 
an automorphism group Aut G, which is finite and only seldom nontrivial. 
Let „ denote the set of distinct such graphs G with given values of g and 
n; this is a finite set. An example of an element of is shown in Figure 



Another name for a graph G C such that \ G is a union of cells is 
a map on S^. One can imagine that the cells are the countries in which the 
graph G divides the surface S^. 

Kontsevich's combinatorial formula for the function (|7j) is the following: 



is the following. Each edge e of G separates two cells (which may be identi- 
cal). These cell carry some labels, say, i and j. Then {zi + Zj)~^ is the factor 
in dHI) corresponding to the edge e. 




Figure 12: A trivalent map on a genus 2 surface 




where the meaning of the term 



•^one side of e ~l~ •Mother side of e 
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To get a better feeling for how this works let us look at the cases {g, n) = 
(0, 3), (1, 1) that we understand well. The space Mo,3 is a point and the only 
nontrivial integral over it is 



{tq To To) = /_ 1 

'Mo,3 



Thus, 



1 



0,3 



2^1 ^3 

The combinatorial side of Kontsevich's formula, however, is not quite trivial. 
The set Gq3 consists of 4 elements. Two of them are shown in Figure El the 





Fi gure 13: Elements of Gq3 

other two are obtained by permuting the cell labels of the graph of the left. 
All these graphs have only trivial automorphisms. Hence, we get 



Ko,3 



2Zi{Zi + Z2)iZi + Zs) 



+ permutations 



+ 



{Zi + Z2){zi + Z3){Z2 + Zs] 



and, indeed, this simplifies to {ziZ2Z3)~^. What is apparent in this example is 
that it is rather mysterious how (jS)), which a priori is only a rational function 
of the ZiS, turns out to be a polynomial in the variables z^ . 

Perhaps this example created a somewhat wrong impression because in 
this case (jHI) was much more complicated than 0. So, let us consider the 
case {g,n) = (1, 1), where the computation of the unique integral 



<^'> = 24 
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Figure 14: The unique graph in G'^ 



already does require some work. The unique element of G"^ i is shown in 
Figure This graph can be obtained by gluing the opposite sides of a 
hexagon, which also explains why the automorphism group of this graph is 
the cyclic group of order 6 (acting by rotations of the hexagon). Thus, (jHI) 
specializes in this case to 

2I ' =11 

6 {zi + 24 zf ' 

as it should. 
1.12 

Kontsevich was led to the formula (jH)) by considering a cellular decomposition 
of 'Mg^n coming from Strebel differentials. In these lectures we shall explain, 
following [14 , different approach to the formula (jH} via the asymptotics in 
the Hurwitz problem of enumerating branched covering of P^. This approach 
is based on the relation between the Hurwitz problem and intersection theory 
on Mg^„ discovered in j^l E] and on the asymptotic analysis developed in [TU] . 
It has several advantages over the approach based on Strebel differentials. 

2 Hurwitz problem 
2.1 

Intersection theory on Mg_„ is about enumerative geometry of families of sta- 
ble n-pointed curves of genus g. The significance of the space Mg^„ is that 
its geometry captures some essential information about all possible families 
of curves. Through the space Mg,„, one can learn something about curves 
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in general from any specific enumerative problem. If the specific enumera- 
tive problem is sufficiently rich, one can gather a lot of information about 
intersection theory on Mg^„ from it. Potentially, one can get a complete un- 
derstanding of the whole intersection theory, which then can be applied to 
any other enumerative problem. 

Our strategy will be to study such a particular yet representative enu- 
merative problem. This specific problem will be the Hurwitz problem about 
branched covering of P^. That there exists a direct connection between Hur- 
witz problem and the intersection theory on M,g^n was first realized in [3 H] . 
The beautiful formula of |3] for the Hurwitz numbers will be the basis for 
our computations. 

In fact, we will see that the (exact) knowledge of the numbers (0) is 
equivalent to the asymptotics in the Hurwitz problem. This is, in some 
sense, very fortunate because asymptotic enumeration problems often tend 
to be more structured and accessible than exact enumeration. 

2.2 

It is a century-old theme in combinatorics to enumerate branched coverings 
of a Riemann surface by another Riemann surface (an example of which is 
shown schematically in Figure IT6|) . Given degree d, positions of ramification 
points downstairs, and their types (that is, given the conjugacy class in S{d) 
of the monodromy around each one of them), there exist only finitely many 
possible coverings and the natural question is: how many ? This very basic 
enumerative problem arises all over mathematics, from complex analysis to 
ergodic theory. These numbers of branched coverings are directly connected 
to other fundamental objects in combinatorics, namely to the class algebra 
of the symmetric group and — via the representation theory of finite groups 
— to the characters of symmetric groups. 

We also mention that there is a general, and explicit, correspondence 
between enumeration of branched covering of a curve and the the Gromov- 
Witten theory of the same curve, see ^Hl- From this point of view, the 
computation of the numbers (0), that is, the Gromov-Witten theory of a 
point, arises as a limit in the Gromov-Witten theory of as the degree goes 
to infinity. This is parallel to how the free energy (jH)) equation arises as the 
limit in the 1-matrix model. 



21 



2.3 



The particular branched covering enumeration problem that we will be con- 
cerned with can be stated as follows. The data in the problem are a partition 
fi and genus g. Let 

/ : C ^ 

be a map of degree 

where C is smooth connected complex curve of genus g. We require that 
cxD G is a critical value of the map / and the corresponding monodromy 
has cycle type /x. Equivalently, this can be phrased as the requirement that 
divisor /~^(oo) has the form 

n 
1=1 

where n = is the length of the partition /i and pi, . . . ,Pn G C are the 
points lying over oo G P^. We further require that all other critical values 
of / are distinct and nondegenerate. In other words, the map /^^ has only 
square- root branch points in P^ \ {00} . The number r of such square-root 
branch points is given by the Riemann-Hurwitz formula 

r = 2g-2 + \fx\+i{fx). (9) 

An example of such a covering can be seen in Figure IT31 where fi = (3) and 
r = 2, hence d = 3, n = 1, and g = 0. 




Figure 15: A Hurwitz covering with fi = (3) 
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We will call a covering satisfying the above conditions a Hurwitz covering. 
Once the positions of the r simple branchings are fixed, there are only finitely 
many Hurwitz coverings provided we identify two coverings 

/ : C ^ P\ /' : C" ^ 

for which there exists an isomorphism h : C —>■ C such that f = f o h. 
Similarly, we define automorphisms of / as an automorphisms h : C ^ C 
such that f = f o h. We will see that, with a very rare exception, Hurwitz 
coverings have only trivial automophisms. 

By definition, the Hurwitz number Uni g{fi) is the number of isomorphism 
classes of Hurwitz coverings with given positions of branch points. In the 
special case when such a covering has a nontrivial automorphism, it should 
be counted with multiplicity |. 

2.4 

The Hurwitz problem can be restated as a problem about factoring permu- 
tations into transpositions. This goes as follows. 

Let us pick a point x G which is not a ramification point. Then, by 
basic topology, all information about the covering is encoded in the homo- 
morphism 

7ri(P"^ \ {ramification points}, x) Aut f~^{x) = S{d) . 

The identification of Aut/^^(x) with S{d) here is not canonical, but it is 
convenient to pick any one of the d\ possible identifications. Then, by con- 
struction, the loop around oo goes to a permutation s G S{d) with cycle type 
/i and loops around finite ramification points correspond to some transposi- 
tions ti, . . . ,tr in S{d). 

The unique relation between those loops in tti becomes the equation 

ti---tr = s. (10) 

This establishes the equivalence of the Hurwitz problem with the problem of 
factoring general permutations into transpositions (up to conjugation, since 
we picked an arbitrary identification of Aut/~^(a;) with S{d)). More pre- 
cisely, the Hurwitz number Hurg(/x) is the number (up to conjugacy, and 
possibly with an automorphism factor) of factorization of the form (jl(J|) that 
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correspond to a connected branched covering. A branched covering is con- 
nected when we can get from any point of f~^{x) to any other point by the 
action of the monodromy group. Thus, the transpositions ti, . . . ,t„ have to 
generate a transitive subgroup of S{d), which is then automatically forced to 
be the whole of S{d). 

The fact that ti,...,tn generate S{d) greatly constraints the possible 
automorphisms of /. Indeed, the action of any nontrivial automorphism on 
f~^{x) has to commute with ti, . . . ,tn, and hence with S{d), which is only 
possible a d = 2. 

By the usual inclusion-exclusion principle, it is clear that one can go back 
and forth between enumeration of connected and possibly disconnected cov- 
erings. Thus, the Hurwitz problem is essentially equivalent to decomposing 
the powers of one single element of the class algebra of the symmetric group, 
namely of the conjugacy class of a transposition 

E ('3) (11) 

l<i<j<d 

in the standard conjugacy class basis. There is a classical formula, going 
back to Frobenius, for all such expansion coefficients in terms of irreducible 
characters. The character sums that one thus obtains can be viewed as finite 
analogs of Hermitian matrix integrals, with the dimension of a representation 
A playing the role of the Vandermonde determinant and the central character 
of (fTTj) in the representation A playing the role of the Gaussian density, see, 
for example [TT| IT^ for a further discussion of properties of such sums. 

2.5 

For us, the crucial property of the Hurwitz problem is its connection with the 
intersection theory on the Dehgne-Mumford spaces Mg,„. This connection 
was discovered, independently, in j3j and jH] , the latter paper containing the 
following general formula 

|Aut/i|ll/i,! y^VC,.^ Ui^-^^^^^) ^ ' 

where r is number of branch points given by ©, = is the length of 
the partition /i, Aut yU is the stabilizer of the vector /x in S'(n), 

A, G (Mg,„) , 2 = !,...,(?, 
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are the Chern classes of the Hodge bundle over „ (it is not important 
for what follows to know what this is), and finally, the denominators are 
supposed to be expanded into a geometric series 

, ^ , = 1 + f^i^i + f^l^i +■■■ , (13) 

which terminates because ijji e H'^ {^9,n) is nilpotent. 

In particular, the integral in the ELSV formula ()12|) is a polynomial in 
the /Xj's. The monomials in this polynomial are obtained by picking a term 
in the expansion (fT!^ for each i = 1, . . . ,n and then adding a suitable A-class 
to bring the total degree to the dimension of Mg,„. It is, therefore, clear 
that the top degree term of this polynomial involves only intersections of the 
V'-classes and no A-classes. That is, 

= X] n ^^i' ^^'^i • • • ^fcn) + lower degree . (14) 

fci+---+fc„=3g-3+n 

These top degree terms are precisely the numbers (jSJ that we want to un- 
derstand. 




2.6 

A natural way to infer something about the top degree part of a polynomial 
is to let its arguments go to infinity. The behavior of the prefactors in (fT^ 
is given by the Stirling formula 

— - ~ —= , m — > cxD . 

ml V27rm 

Let iV be a large parameter and let /ij depend on in such a way that 

— ^Xi, 2 = l,...,n, A^-^oo, 

where finite. We will also additionally assume that all /Xj's are 

distinct and hence | Aut/i| = 1. Then by p4|) and the Stirling formula, we 
have the following asymptotics of the Hurwitz numbers: 

1 Hurg(/i) 



1 TJ 



(27r)"/2 



E {rk.---nJ=:H,ix). (15) 
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It is convenient to Laplace transform the asymptotics Hg{x). Since 



we get 

e-^- H,{x) dx= (-^-^ r,Jl[ , (16) 

'^'*>o ki+---+k„=3g-3+n ^ 

which up to the following change of variables 

Zi V^2Sj , i — 1 , . . . , 77. , 

is precisely the Kontsevich generating function for the numbers 

Thus, we find ourselves in situation which looks rather comfortable: the 
generating function that we seek to compute is not only related to a specific 
enumerative problem but, in fact, it is the Laplace transform of the asymp- 
totics in that enumerative problem. People who do enumeration know that 
asymptotics tends to be simpler than exact enumeration and, usually, the 
Laplace (or Fourier) transform of the asymptotics is the most natural thing 
to compute. 

This general philosophy is, of course, only good if we can find a handle 
on the Hurwitz problem. In the following subsection, we will discuss a re- 
statement of the Hurwitz problem in terms of enumeration of certain graphs 
on genus g surfaces that we call branching graphs. This description will 
turn out to be particularly suitable for our purposes (which may not be a 
huge surprise because, after all, Kontsevich's formula (jHl) is stated in terms 
of graphs on surfaces). 

2.7 

A very classical way to study branched coverings is to cut the base into 
simply-connected pieces. Over each of the resulting regions the covering be- 
comes trivial, that is, consisting of d disjoint copies of the region downstairs, 
where d is the degree of the covering. The structure of the covering is then 
encoded in the information on how those pieces are patched together up- 
stairs. Typically, this gluing data is presented in the form of a graph, usually 
with some additional labels etc. 
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There is, obviously, a considerable flexibility in this approach and some 
choices may lead to much more convenient graph enumeration problems than 
the others. For the Hurwitz problem, we will follow the strategy from pp, 
which goes as follows. 

Let 

/ : C ^ 

be a Hurwitz covering with partition /i and genus g. In particular, the number 
r of finite ramification points of / is given by the formula Q. Without loss 
of generality, we can assume these ramification points to be rth roots of unity 
in C. Let us cut the base P^ along the unit circle S = {\z\ = 1}, that is, let 
us write 

= D_USUD+ 

where 

Di = {\z\^l} 

are the Southern and Northern hemisphere in Figure IT^ respectively. 



CO 




Figure 16: A Hurwitz covering / : S2 — P 



Since the map / is unramified over D_, its preimage f^^{DJ) consists of 
d disjoint disks. Their closures, however, are not disjoint: they come together 
precisely at the critical points of /. By construction, critical points of / are 
in bijection with its critical values, that is, with the rth roots of unity in P^. 
Thus, the the set f~^{DJ) C C looks like the structure in Figure fTTI This 
structure is, in fact, a graph F embedded in a genus g surface. Its vertices 
are the components of f^^{D_) and its edges are the critical points of / that 
join those components together. In addition, the edges of F (there are r of 
them) are labeled by the roots of unity. 

This edge-labeled graph F C S^, is subject to some additional constraints. 
First, the cyclic order of labels around any vertex should be in agreement 
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Figure 17: Preimage / ^{D_) on S2 



with the cychc order of roots of unity. Next, the complement of F consists 
of n topological disks, where n is the length of the partition /i. Indeed, the 
complement of F corresponds to f~^{D^) and 2 = cxd is the only ramification 
point in D+. The connected components of f~^{D^) thus correspond to 
parts of /i. 

The partition /i can be reconstructed from the edge labels of F as follows. 
Pick a cell Ui in f^^{D+). The length of the corresponding part /ij of /i is 
precisely the number of times the map / wraps the boundary dUi around the 
circle 5*. As we follow the boundary dUi, we see the edge labels appear in 
a certain sequence. As we complete a full circle around dUi, the edge labels 
will make exactly fii turns around S. It is natural to call this number /ij the 
perimeter of the cell Ui. This perimeter is (27r)~^ times the sum of angles 
between pairs of the adjacent edges on dUi, where the angle is the usual angle 
in (0, 27r) between the corresponding roots of unity, see Figure IT71 

We call a edge-labeled embedded graph F as above a branching graph. 
By the above correspondence, the number Hurg(/i) is the number of genus g 
branching graphs with n cells of perimeter /ii, . . . , As usual, in the trivial 
d = 2 case, those graphs have to be counted with automorphism factors. 

It is this definition of Hurwitz numbers that we will use for the asymptotic 
analysis in the next lecture. 
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2.8 



It may be instructive to consider an example of how this correspondence 
between coverings and graphs works. Consider the covering corresponding 
to factorization 

(12) (13) (24) (14) (13) = (1243) 

of the form ()10|) . The degree of this covering is d = 4, it has r = 5 ramification 
points, and the monodromy n = (4) around infinity. It follows that its genus 
is g = 1. Let us denote the five finite ramification points by 

{a,b,c,d,e} = {l,e2"/^...,e^"/^}. 

The preimage of Z)_ on the torus Si consists of 4 disks and the momodromies 
tell us which disk is connected to which at which critical point: for example, 
at the critical point lying over a, the 1st disk is connected to the 2nd disk. 
This is illustrated in Figure ITSl where, among the 3 preimages of any critical 
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Figure 18: Preimage of / ^{DS) on the torus Si 

value, the one which is a critical point is typeset in boldface. Clearly, any 
disk in f~^{DJ) has the alphabet {a, 6, c, d, e} going counterclockwise around 
its boundary and, in particular, the cyclic order of the critical values on its 
boundary is in agreement with the orientation on Si. 

Observe that the preimage /~^(Z)+) is one cell whose boundary is a 4-fold 
covering of the equator. In particular, the alphabet {a, b, c, d, e} is repeated 
4 times around the boundary of /^^ (/)+). Finally, Figure ^1 shows the 
branching graph translation of Figure ^1 
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Figure 19: Branching graph corresponding to Figure^] 

2.9 

Finally, a few remarks about how one can prove a formula like ()12|1 . This 
will necessary be a very sketchy account; the actual details of the proof can 
be found in [71 [Hj , as well as in the original paper j2] . 

As mentioned before, the numbers like Hurg(/i) a special case of in the 
integrals in the Gromov-Witten theory of P^, that is, certain intersections on 
the Kontsevich moduli space Mg,d,„(P^) of stable degree d maps 

/ : C ^ 

from a varying n-pointed genus g domain curve C to the fixed target curve 
P^ 

Since such a map can be composed with any automorphism of P^, we have 
a C^-action on Mg,d,n(P^)- A theory due to Graber and Pandharipande [Hj 
explains how to localize the integrals in Gromov-Witten theory to the fixed 
points of the action of the torus C^. These fixed point loci in 'Mg^d,n{^^) are, 
essentially, products of Deligne-Mumford spaces Mg. „^ for some QiS and n^'s. 
Indeed, only very few maps are fixed by the action of the torus. Namely, for 
the standard -action on P^ and an irreducible domain curve C the only 
choices are the degree constant maps to {0, oo} = (P^) or the degree d 
map 

In general, the domain curve is allowed to be reducible, but still any torus- 
invariant map has to be of the above form on each component Ci of C. Once 
all discrete invariants of the curve C are fixed (that is, the combinatorics 
of its irreducible components, their genera and numbers of marked points 
on them) the remaining moduli parameters are only a choice of a bunch of 
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curves to collapse plus a choice of where to attach the non-collapsed P^'s to 
them. That is, the torus-fixed loci are products of Deligne-Mumford spaces, 
modulo possible automorphisms of the combinatorial structure. 

In this way integrals in the Gromov-Witten theory of can be reduced, 
at least in principle, to computing intersections on M^^ri. An elegant local- 
ization analysis leading to the ELSV formula is presented in [7j, see also 



3 Asymptotics in Hurwitz problem 
3.1 

Our goal now is to see how the Laplace transform p6p of the asymptotics (jlSp 
in the Hurwitz problem turns into Kontsevich's combinatorial formula (jH)). 
The formulation of the Hurwitz problem in terms of branching graphs, see 
Section 12. 7| looks promising. Indeed, a branching graph F is by definition 
embedded in a topological genus g surface Eg and and it cuts T,g into n 
cells. Here the numbers g and n are the same as the indices in Mg^„, on 
the intersection theory on which we are trying to understand. Similarly, in 
Kontsevich's formula we have a graph G embedded into and cutting it 
into n cells. This graph G, however, is a more modest object: it does not 
have any edge labels and it is allowed to have only 3-valent vertices. 

Recall that we denote by „ the set of all possible 3-valent graphs as in 
Kontsevich's formula (jHl). Let us introduce two larger sets 

(~3 (- (~>3 (- r 

on which, by definition, the 3-valence condition is weakened to allow vertices 
of valence 3 or more, and dropped altogether, respectively. The elements of 
G^_^ can be obtained from elements of G^ ,„ by contracting some edges. In 
particular, the set is still a finite set. Similarly, denote by H^^^ the set 
of all branching graphs with given genus g and perimeter partition /i. Our 
first order of business is to construct a map 



which we call the homotopy type map. This map is the composition of the 
map 

l_l , f 
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which simply forgets the edge labels with the map 

r 

which does the following. First, we remove all univalent vertices together 
with the incident edge. After that, we remove the all remaining 2-valent 
vertices joining their two incident edges. What is left, by construction, has 
only vertices of valence 3 and higher and still cuts into n cells. 

We remark that in the two exceptional cases {g,n) = (0, 1), (0,2), which 
correspond to unstable moduli spaces, what we get in the end (a point and 
a circle, respectively) is not really an element of G^^. In all other cases, 
however, we do get an honest element of G^^. Figure EOl illustrates this 
procedure applied to the branching graph from Figure IT7I 




Figure 20: The homotopy type of the branching graph from Figure IT7I 



3.2 

Now let us make the following simple but important observation. Since 
the set G|^^ is finite and we are interested in the asymptotics of Ruiglfj,) 
as /i — cx) while keeping g and n fixed, we can just do the asymptotics 
separately for each homotopy type and then sum over all possible homotopy 
types. The Laplace transform will then be also expressed as a sum over 
all corresponding homotopy types G in G|^. 

We now claim that not only Kontsevich's combinatorial formula (jHl) is 
the Laplace transformed asymptotics ()16p but, in fact, the summation over 
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G G G^^ in Kontsevich's formula corresponds precisely to summation over 
possible homotopy types. Since there are non-trivalent homotopy types, im- 
plicit in this claim is the statement that non-trivalent homotopy types do not 
contribute to asymptotics. 

3.3 

What do we need to do to get the asymptotics of the number of branching 
graphs of a given homotopy type G ? What would suffice is to have a simple 
way to enumerate all such branching graphs. To enumerate all branching 
graphs with given homotopy type G, we need to retrace the steps of the 
homotopy type map. Imagine that the homotopy type graph G is a fossil 
from which we want to reconstruct some prehistoric branching graph F. What 
are the all possible ways to do it ? 

The answer to all these rhetoric questions is quite simple. It is easy to see 
that the preimage of any edge in G is some subtree in the original branching 
graph r. In addition, all these trees carry edge-labels which were erased by 
the homotopy type map. Thus, for any edge e of G, we need to take a tree T^. 
whose edges are labeled by roots of unity. In particular, there is a canonical 
way to make this tree planar, that is, embed it in the plane in such a way that 
the cyclic order of edges around each vertex agrees with the order of their 
labels. In particular, each such tree is a branching tree, that is, it satisfies 
the = (0, 1) case of our definition of a branching graph^. 

Next, these trees are to be glued into the graph V by identifying some of 
their vertices, as in Figure |^ This means that each of these branching trees 
carries two special vertices, which we call its root and top. These special 
vertices of mark the places where is attached to the other trees in F. 
We will call such a branching tree with two marked vertices an edge tree. 

3.4 

Now we have a procedure which from a homotopy type G and a collection 
of edge trees {Tg} with distinct labels assembles a branching graph F. This 
procedure, which we will call assembly, does have some imperfections. Those 
imperfections will be discussed momentarily, but first we want to make the 
following important observation. 

■^A small and inessential detail is that the labels Te are taken from a larger set of roots 
of unity 
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Figure 21: Assembling a branching graph from edge trees 

Since the homotopy type graph G is something fixed and finite, the whole 
asymptotics of the branching graphs lies in the edge trees. For a large random 
branching graph F, those edge trees will be large random trees. This is how 
the theory of random trees enters the scene. Fortunately, a large random tree 
is a very well studied and a very nicely behaved object, see for example ^H] for 
a particularly enjoyable introduction. It turns out that all the information we 
need about random trees is either already classical or can be easily deduced 
from known results. 

In fact, all required knowledge about random trees can be quite easily 
deduced (as was done in ^3]) from the first principles, which in this case, 
is the following formula going back to Cayley J7j. Consider all possible 
trees T with the vertex set {l,...,m}. For any such tree T, we have a 
function valxii) which takes the vertex z = 1, . . . , m to its valence in T. The 
information about all vertex valences in all possible trees T is encoded in the 
following generating function 

J2 ^r'"^'^ ■ ■ ■ c'"^'"^ = z,---zUzi + --- + z^r-' . (17) 

T 

A probabilistic restatement of this result is the following. The valence valT(^) 
is the number of edges of T incident to the vertex i. Let us cut all edges 
in half; since there were m — 1 edges of T, we get 2m — 2 half edges. The 
formula (fTTj) says that the same distribution of half-edges can be obtained 
as follows: give every vertex a half-edge and the remaining m — 2 edges just 
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throw at the vertices randomly hke darts. 

What is then the valence of a given vertex in a random tree T ? It is 1 
for the half edge allowance that it always gets plus its share in the random 
distribution of m — 2 darts among m targets. As m —>■ oo, this share goes to 
a Poisson random variable with mean 1. In other words, as m — oo we have 

Prob{valT(2) = i;} ^ -^^^^-^ , v = l,2,.... (18) 

For different vertices, their valences become independent in the m oo 
limit. 

Also, setting all variables in (jl7|) to 1 we find that the total number of 
trees with vertex set {!,..., m} is m*""^. 



3.5 

Now it is time to talk about how the assembly map differs from being one- 
to-one (it clear that it is onto). 

First, it may happen that the cyclic order of edge labels is violated at 
one of the vertices of G where we patch together different edge trees. If this 
is the case, we simply declare the assembly to be a failure and do nothing. 
The probability of such an assembly failure in the large graph limit can be 
computed as follows. Suppose that we need to glue together three vertices 
with valences vi, V2, and v^. From (fTHj) . the chance of seeing these particular 
valences is 

e-3 



{vi - i)\{v2 - mv^ - 1)\ ■ 

On the other hand, the conditional probability that the edge labels in the 
resulting graph are cyclically ordered, given that they were cyclically ordered 
before gluing is easily seen to be 

{v,-iy.{v2-mvs-i)\ 

{Vi + V2 + Vs - 1)\ 

Hence the success rate of the assembly at a particular trivalent vertex is 

1 



y 

{Vi + V2 + V3 - ly. 
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Assembly failures at distinct vertices being asymptotically independent events, 
this goes into an overall factor and, eventually in the prefactor in (jSI). 

At this point it should be clear that there in no need to consider non- 
trivalent vertices. Indeed, a homotopy type graph with a vertex of valence 
> 4 can be obtained from a trivalent graph by contracting some edges, hence 
corresponds to the case when some of the edge graphs are trivial. It is obvi- 
ous that the chances that a large random tree came out empty are negligible. 
Hence, nontrivalent graphs make indeed no contribution to the asymptotics 
and can safely be ignored. 

3.6 

The second (minor) issue with the assembly map is that we can get the same, 
that is, isomorphic branching graphs starting from different collections of the 
edge trees. This happens if the homotopy type graph G has nontrivial auto- 
morphisms. It is clear that the group Aut(G') acts on edges of G and, hence, 
acts by permutations on collections of edge trees preserving the isomorphism 
class of the assembly output. It is also clear that the chance for a large edge 
tree to be isomorphic to another edge tree (or to itself with root and top 
permuted) is, asymptotically, zero. Hence almost surely this Aut(G') action 
is free and hence there is an overcounting of branching graphs by exactly a 
factor of I Aut(G')|. This explains the division by | Aut(G')| in (jSI). 

3.7 

Now, after explaining the summation over trivalent graphs and the auto- 
morphism factor in (jSI), we get to the heart of Kontsevich's formula — the 
product over the edges. 

It is at this point that the convenience (promised in Section 17!^ of work- 
ing with the Laplace transform ()16|) rather then the asymptotics fllSp itself 
can be appreciated. We will see shortly that, asymptotically, the cell perime- 
ters of a branching graph F assembled from a 3-valent graph G and bunch 
of random edge trees {Tg} is a sum of independent contributions from each 
edge of G. This makes the Laplace transform (fT^ factor over the edges of 
G as in (jH)). To justify the above claim, we need to take a closer look at a 
large typical edge tree. 
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3.8 



Let T be an edge tree. It has two marked vertices, root and top; let us call 
the path joining them the trunk of T. The tree T naturally splits into 3 
parts: the root component, the top component, and the trunk component, 
according to their closest trunk point. This is illustrated in Figure |221 



trunk component . ' ^ 

, top 




Figure 22: The components of an edge tree 

Figure |221 may give a wrong idea of the relative size of these components 
for a typical large edge tree. Let M — oo be the size (e.g. the number 
of vertices) in T. It is known and can be without difficulty deduced from 
fll7|) (see for example ^3]) that the size M distributes itself among the three 
components of T in the M — > cxd limit as follows. 

First, the size of the root and the top component stays finite in the 
M — > cxD limit. In fact, it goes to the Borel distribution, given by the following 
formula 

l^k 1 

Prob(A;) = , A; =1,2,.... 

k\ 

Second, the typical size of the trunk is of order a/M. More precisely, scaled 
by a/M, the trunk size distribution goes to the Rayleigh distribution with 
density 

xe"^ /^dx, X G (0, oo) . 

For our purposes, however, it only matters that the size of all these parts is 
o(M) as M cx). 
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The overwhelming majority of vertices he, therefore, somewhere in the 
branches of the trunk component of T. What is very important is that, after 
assembly, any such vertex will find itself completely surrounded by a unique 
cell. As a result, it will contribute exactly 1 to that cell's perimeter. What 
this analysis shows it that, asymptotically, the cell perimeters are determined 
simply by the the number of such interior trunk vertices ending up in a given 
cell, all other contributions to perimeters being o{M). It should be clear that 
such contributions of distinct edges of G are indeed independent, leading to 
the factorization in (jH}. 

3.9 

What remains is to determine is what the edge factors are, that is, to de- 
termine the actual contribution of an edge tree T to the perimeters of the 
adjacent cells. 

All we need to know for this is to know how vertices in the trunk compo- 
nent distribute themselves between the two sides of the trunk, as in Figure 
1221 One shows, see |T3j and below, that the fraction of the vertices that 
land on a given side of the trunk is, asymptotically, uniformly distributed 
on [0, 1]. This reduces the computation of the edge factor to computing one 
single integral. That computation will be presented in a moment, after we 
review the knowledge that we have accumulated so far. 

3.10 

Let G be a 3-valent map with n cells. It has 

\E{G)\ = 6g-6 + 3n 

edges and 

\V{G)\ = 4^ - 4 + 2n 
vertices, which follows from the Euler characteristic equation 

\ViG)\-\EiG)\+n = 2-2g 

combined with the 3-valence condition 3|\^(G)| = 2\E{G)\. 

Let e G E{G) be an edge of G and let be the corresponding edge tree. 
Let de be the number of vertices of Tg. Ignoring the few vertices on the trunk 
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itself, the vertices of Te distribute themselves between the two sides of the 
trunk of Tg. Let's say that Pe vertices are on the one side and define the 
number by 

Pe + qe = de. (19) 

It is clear that qe is the approximate number of vertices on the other side of 
the trunk. We call the numbers Pe and qe the semiperimeters of the tree T^. 

The basic question, which we now can answer in the large graph limit, 
is how many branching graphs T have given semiperimeters {{pe,qe)}eeE{G)- 
This distribution can be computed asymptotically as follows. 



3.11 

First, there are some overall factors that come from automorphisms of G 
and the assembly success rate. Recall that in Section 13.51 we saw that the 
assembly success rate is e~^'^''*^^'2~l^'^'^)L 

Second, for every edge e G E{G) we need to pick an edge tree Tg with de 
vertices. As we already learned from (jl7|) . the number of vertex-labeled trees 
with de vertices is 6?^*="^. Vertex labels can be traded for edge labels at the 
expense of the factor de\/{de — 1)! = de, hence there are df^'^ edge labeled 
trees with de vertices. The choice of the root vertex brings in additional factor 
of de choices. Once the root is fixed, the condition (fT^ dictates the position 
of the top, so there is no additional freedom in choosing it. To summarize, 
there are ~ df'^"^ edge trees with given semiperimeters Pe and qe- 

Third, the edge labels of F is a shuffle of edge labels of the trees Tg. Let 



J2ide 
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be the total number of edges in F (and, hence, also the total number of simple 
branch points in the Hurwitz covering corresponding to F). Obviously, there 
are 

■ TTT (20) 



ways to shuffle edge labels of {Te} into edge labels of F. 

Putting it all together, we obtain the following approximate expression for 
the number of branching graphs with given semiperimeters {{pe, (le)}e<^E{G) 

11 ZTTrTTi TXThTTtTT 11 /Kz j3/2 ' (21) 



J^,,(d.-iy. |Aut(G)| J,l,^^V2^£'^ 
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where 

d=\V{T)\=Y,de-2\V{G)\ 

e 

is the degree of the corresponding Hurwitz covering and the RHS of (j2H) is 
obtained from the LHS by the Stirhng formula. 

Note that the factor r! e'^ precisely cancels with pref actor in ()15|). 

3.12 

Since the cell perimeters of F are the sums of edge tree semiperimeters along 
the boundaries of the cells, the computation of the Laplace transform ()16|] 
indeed boils down to the computation of a singe edge factor 

p-psi-qs2 11/" dr 



27r JJp,q>o {p + qf^^ Si - S2 



2;3/2 



1 r (-i) vsT- 



2-n ' si - S2 



where we set x = p + q. 

Recall that the relation between the Laplace transform variables Sj in 
fll6|) and the variables Zi in Kontsevich's generation function [7| is 

Zi '^'Zsi , i — 1 , . . . , ?7, . 

Thus we get indeed the LHS of (jHl), including the correct exponent of 2, 
which is 

\E{G)\-\V{G)\ = 2g-2 + n. 
This completes the proof of Kontsevich's formula (jH)). 



4 Remarks 
4.1 

Since random matrices are the common thread of many talks at this school, 
let us point out various connections between moduli of curves and random 
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matrices. As we already discussed, the original KdV conjecture of Witten 
was based on physical parallelism between intersection theory on Mg,„ and 
the double scaling limit of the Hermitian 1-matrix model. Despite many 
spectacular achievements by physicists as well as mathematicians, this dou- 
ble scaling seems to remain a source of serious mathematical challenges, in 
particular, it appears that no direct mathematical connection between it and 
moduli of curves is known. On the other hand, there is a very direct connec- 
tion between what we did and another, much simpler, matrix model, namely, 
the edge scaling of the standard GUE model. This connection goes as follows. 

Recall that by Wick formula the coefficients of the 1/N expansion of the 
following N X N Hermitian matrix integral 



are the numbers of ways to glue a surface of given topology from m polygons 
with perimeters ki, . . . ,kn- The double scaling limit of the 1-matrix model 
is concerned with gluing a given surface out of a very large number of small 
pieces. An opposite asymptotic regime is when the number m of pieces stays 
fixed while their sizes ki go to infinity. Since for large k the trace tr picks 
out the maximal eigenvalues of H, this asymptotic regime is about largest 
eigenvalues of a Hermitian random matrix. In the large limit, the distribu- 
tion of largest eigenvalues of H is well known to be the Airy ensemble. This 
edge scaling random matrix ensemble is very rich, yet susceptible to a very 
detailed mathematical analysis. In particular, the individual distributions of 
eigenvalues were found by Tracy and Widom in JHl • They are given in terms 
of certain solutions of the Painleve II equation. 

The connection between GUE edge scaling and what we were doing is 
the following. If one takes a branching graph as in Figure El and strips it 
off its edge labels, one gets a map on genus with a few cells of very large 
perimeter, that is, an object of precisely the kind enumerated by (f^^ in the 
edge scaling regime. We argued that almost all vertices of a large branching 
r graph are completely surrounded by a unique cell, hence contribute exactly 
1 to that cell's perimeter regardless of the edge labels. This shows that edge 
labels play no essential role in the asymptotics, thus establishing a direct 
connection between Hurwitz numbers asymptotics and GUE edge scaling. A 
similar direct connection can be established in other situations, for example, 
between GUE edge scaling and distribution of long increasing subsequences 




1 



(22) 
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in a random permutation, see jTU]. Since a great deal is known about GUE 
edge scaling, one can profit very easily from having a direct connection to it. 
In particular, one can give closed error-function-type formulas for a natural 
generating functions (known as n-point functions) for the numbers (0), see 

There exists another matrix model, namely the Kontsevich's matrix model 
0, specifically designed to reproduce the graph summation in (jHI) as its di- 
agrammatic expansion. Once the combinatorial formula (jH)) is established, 
this Kontsevich's model can be used to analyze it, in particular, to prove the 
KdV equations, see P and also 0. 

Alternatively, the KdV equations can be pulled back from the GUE edge 
scaling (where they have been studied in depth by Adler, Shiota, and van 
Moerbeke) via the above described connection, see the exposition in fT^ . 

4.2 

In our approach, the intersections (0), the combinatorial formula (jH)), the 
KdV equations etc. appear through the asymptotic analysis of the Hurwitz 
problem. The ELSV formula ()12|). which is the bridge between enumeration 
of branched coverings and the intersection theory of „ is, on the hand, an 
exact formula. It is, therefore, natural to ask for more exact bridges between 
intersection theory, combinatorics, and integrable systems. 

After the moduli space Mg,„ of stable curves, a natural next step is the 
Gromov-Witten theory of P^, that is, the intersection theory on the moduli 
space 'Mg^n{F^, d) of stable degree d maps 

from an n-pointed genus g curve C to the projective line P^. More generally, 
one can replace P^ by some higher genus target curve X. It turns out, see 
[T^ . that there is a simple dictionary, which we call the Gromov-Witten- 
Hurwitz correspondence, between enumeration of branched coverings of P^ 
and Gromov-Witten theory of P^. This correspondence naturally connects 
with some very beautiful combinatorics and integrable systems, the role of 
random matrices now being played by random partitions. The connection 
with the integrable systems is seen best in the equivariant Gromov-Witten 
theory of P^, where the 2-Toda lattice hierarchy of Ueno and Takasaki plays 
the role that KdV played for Mg^„. 
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